A scale-free network hidden in the collapsing polymer 
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We show that the collapsed globular phase of a polymer accommodates a scale-free incompatibility 
graph of its contacts. The degree distribution of this network is found to decay with the exponent 
7 = 1/(2 — c) up to a cut-off degree dc oc L^~'^, where c is the loop exponent for dense polymers 
(c — 11/8 in two dimensions) and L is the length of the polymer. Our results exemplify how a 
scale-free network (SFN) can emerge from standard criticality. 
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The underlying network structures of the world-wide- 
web, power grids, social contacts, protein interactions, 
etc.m bear peculiar properties absent in random net- 
works. The graphs associated with them not only share 
a power law degree distribution commonly referred as 
"scale-free" , but also an unusually small diameter (a mea- 
sure of the average shortest distance between two nodes) 
and typically a high degree of clustering. The advan- 
tages/disadvantages of these qualities on the efficiency 
of the network have been thoroughly discussed in the 
context of epidemic spreading |^ and resilience against 
random or malicious attacks |^. 

The search for non-growing SFNs originating from 
equilibrium statistics and/or optimization has been fo- 
cus of attention recently 0, 0,0- ^^'^ Y^*' although 
some statistical models on SFNs have already been stud- 
ied there is, to our knowledge, no demon- 
strated link from conventional critical models to the crit- 
icality hinted by the network's scale- freeness. As shown 
here, this link may be established by associating a net- 
work with each microstate of the model. Such a network 
representation is also convenient for studying the sys- 
tem's complexity and may have wider applicability. For 
example, a recipe similar to ours was recently adopted for 
investigating the navigational complexity of cities . 

In this Letter, we demonstrate that a SFN is associated 
with the collapsed phase at low temperature, T, and pre- 
sumably also with the 0-point (T = Tg) of a polymer. By 
modelhng the collapsed polymer as a self-attracting self- 
avoiding walk on a square lattice we obtain the de- 
gree exponent in terms of a critical index known exactly 
for 2D. Numerical simulations confirm the analytically 
predicted degree exponent and show that the scale-free 
character of the network is a direct consequence of the 
polymer criticality. 

The configuration of a polymer chain of L monomers 
can be represented by an arc-diagram (see Fig. ^,b) 
which carries the same information as the contact map 
(isf. It is obtained when one imagines stretching the 
polymer into a horizontal straight line while pairs of 
monomers that were originally in contact are joined by 
arcs on the upper half plane (the diagram is drawn on a 
plane even though the polymer may be embedded in arbi- 
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FIG. 1: A polymer on a square lattice and its self-contacts 
together with the corresponding arc-diagram and the incom- 
patibility graph. 



trary dimensions). The arc-diagram is said to be planar 
if no two arcs cross each other, i.e., if there are no inter- 
penetrating contacts. Otherwise, a non-trivial incompat- 
ibility graph (IG) can be associated with the arc-diagram 
(Fig.^), such that, each arc (contact) represents a node 
on the IG and two nodes are connected by an edge if the 
corresponding arcs cross. More precisely, if the monomer 
units of the polymer are indexed 1..L and the contacts 
are labeled by indices of the involved monomer pair {i, j) 
with i < j, then the two nodes (ji,ji) and (^2,^2) of the 
IG are joined by an edge if and only if there exists an 
integer n satisfying ii < n < ji and 12 < < i2- 

The IG provides a representation for the complexity of 
the contact structure of a folded polymer with particular 
reference to the arrangement of pseudoknots. This rep- 
resentation is frequently used in RNA folding research 
17, 18, ,2M where pseudoknots are biologically active 
structural components. An empty IG (no edges, meaning 
no pseudoknots) is a common simplifying assumption in 
RNA prediction algorithms 20, 21, 22]. Alternatively, in 
case of the traveling salesman problem (TSP) the same 
complexity refers to the shortest path connecting a given 
number of cities on a map where repeated visitations of a 
neighborhood compounds to a contact. In d=:2, TSP has 
recently been conjectured to be in the same universality 
class as dense polymers . Rosvall et al. use a similar 
mapping from a city road-map to the IG, through which 
they quantify the information content [I3. 
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Our goal here is to show that the IG of a homopolymer 
at low temperatures is a scale-free network. To this end, 
we will approximate the true arc-diagram of the polymer 
with a random arc-diagram where the probability of hav- 
ing an arc between two monomers m distance apart along 
the chain is given by its asymptotic value for an infinite 
polymer, i.e., P(m) ~ m~'^. The exponent c is related 
to the return probability and equals d/2 for a random 
walk Its counterpart for the self-avoiding polymer 
is c = du—a^ [lEIi^- Note that, apart from the modified 
radius of gyration exponent v {v = 1/2 for the random 
walk) it additionally involves the four-leg vertex expo- 
nent (74 which, e.g., plays a key role in DNA physics |27| . 
The exponents v and 174 are known exactly or to high 
precision for all integer dimensions, except for T <Tg va 
3D 1^ . We will find below that the tail of the degree dis- 
tribution for the IG of such a random arc-diagram obeys 
the power law 

P{d)r.d--^ , (1) 

with 7 = 1/(2 — c) in an infinitely long polymer. 

Consider an arc-diagram of the polymer with the dis- 
crete monomer centers labeled by integers \..L. As dis- 
cussed above, we set the arc-length distribution to be 

P(m) = m-7(C(c,l)-C(c,i + l)) , (2) 

where the proper normalization is given in terms of the 
generalized (Hurwitz) zeta function 



C(c,a)^^(T 



(3) 



Throughout the paper we will take c > 1, as is the case 
for all systems of interest. We start by considering a 
single arc in the arc-diagram, corresponding to a loop of 
length m. We label the monomers lying on the loop as 
1 < i < TO— 1, so that the contacting monomer pair (0, m) 
which closes the loop is excluded. We want to calculate 
the probability distribution for the corresponding node 
in the equivalent incompatibility graph to have degree d. 

The probability that the considered arc crosses another 
one which has one of its legs at monomer i(l<i<m— 1) 
is 

_ l _C(c,i + 1) + C(c,TO -i + l) 



(4) 



C(c,l)-C(c,i + 1) 

Below, we fix the average density of contacts (which is a 
function of temperature only) through the arc fugacity, 
z, so that the arc density satisfies 

a = z [5(1 - z)-^/dz]/{l - z)-i = z/(l - z) . (5) 

The probability that out of the n arcs originating from 
the site i, di will cross the reference arc (0,to) is 

^i,ra{di) = 

00 

(1 - z) ^ z" -..^UcY^ [1 - 7r,,™(c)]"-''' . (6) 

n—di 



L=1600 
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FIG. 2: The plots for increasing L converge to the expected 
scaling function of Eg. llOH . The inset shows the degree dis- 
tribution for different chain lengths before scaling the axes. 



Now we can write down the probability that the reference 
arc crosses a total of d arcs as: 



Pm{d) 



(7) 



i=l 



The difficulty due to the restricted sum is overcome by 
shifting to the grand-canonical formulation with an arc- 
crossing fugacity, /x. If we define Pm{^J) to be the Laplace 
transform of Eq.jTI), then a further summation over all 
loop lengths to gives the Laplace transform P(/i, L) of 
the degree distribution we are after. Using Eas. (|6l7fl we 
obtain 



P(/x,L) = ^P(d,i)/ 

L m— 1 

^ E™"' n [1+^(1 -^K"]^' (8) 

m— 1 i—1 

We solved for P{d, L) numerically using Mathemat- 
ica. The probability distribution for the node degrees 
obtained exhibits a clear power law with a sharp finite 
size cut-off (Fig. 121 inset). The analysis of the data for a 
range of lengths L and several values of c gives the finite 
size scaling form: 



P(d,L) =d-l/(2-=)/(d/L2-c) 



(9) 



where f{x) is a scaling function determining the window 
of validity for the power law. 




x<0{l) 
otherwise. 



(10) 



When the two axes are normalized in accordance with 
Eq.©, we obtain a convincing asymptotic collapse for 
P{d, L) for increasing L (Fig. hence assuring the va- 
lidity of the assumed scaling form. 

It is clear from Eas. (|9ll0|l that the power law disap- 
pears when c > 2. This, in fact, is the case for a freely 
fluctuating self-avoiding coil in two and three dimensions. 
For convenience, we list below the value of the loop ex- 
ponent, c, for various cases: 



Temperature 


2D 


3D 


T <Te 


1.375 








1.571 


1.5 




T>Tg 


2.688 


2.22 


m 



For the low T phase in three dimensions no estimate of 
tT4 is available yet. In this case, simulations suffer from 
severe surface effects which add to the difficulties of sam- 
pling compact configurations. 

The random-graph approximation we employed above 
omits correlations among contacts in the IG graph. For 
example, we ignored the fact that the IG is bipartite 
in two dimensions Thus, the conclusions of the 

random-graph scenario are not guaranteed a priori to 
hold for true polymers and it is necessary to confirm them 
through numerical simulations. For this purpose, we gen- 
erated self-avoiding walks on a square lattice up to length 
L — 3200 by using the Pruned-Enriched Rosenbluth 
Method (PERM) a particularly efficient Monte- 

Carlo technique for interacting polymers. A nearest 
neighbor attractive interaction e was introduced in or- 
der to induce collapse 0| . For each sample we obtained 
the normalized degree distribution of its incompatibility 
graph and averaged them with their associated Boltz- 
mann weight (Fig. OJ. 

According to the table above and Eq.®, high T poly- 
mers do not have scale-free IGs. We first checked through 
computer simulations ai T > Tg that this indeed is true. 
In fact, the IG in this case is composed of many discon- 
nected subgraphs with an exponentially decaying distri- 
bution of number of nodes. Our preliminary results show 
that the c exponent controls the power-law distribution 
for the span along the polymer backbone of each isolated 
subgraph comprising the IG. 

According to Eas. l|Mn|l SEN should emerge when c 
gets smaller than 2, i.e. when we cross the 0-point. We 
confirm this by considering next the case where the scale- 
free nature of the IG is expected to be most easy to 
demonstrate, namely the T < Tg regime in two dimen- 
sions. The small loop exponent c in this regime renders 
a wider scaling region on the basis of Eq.©. The tem- 
perature was fixed to T/e = 1.0 < Tg/e ~ 1.54 [11], deep 
in the low T phase. 

The degree distribution for not too high IG node de- 
grees appears to be self-averaging, since we were able to 
identify the power law behavior even in a single sample. 




degree, d 

FIG. 3: Averaged degree distribution for the IG of a 
self-attracting self-avoiding walk of 400 (□),800(0), 1600(A), 
3200(») steps on square lattice at T/e = 1.0 < Tg/e. The solid 
curves are the model's predictions with c = du — a4, — 1.375. 

The data presented in Fig. |21 are averaged over at least 
1000 independent samples, mainly to determine the cut- 
off behavior. For comparison, the corresponding random- 
graph prediction obtained from Eq.® is also shown in 
the same figure with a solid curve. The curve and the 
data points are overlapped by a suitable vertical shift to 
compensate for the discrepancy between the two at very 
small degree numbers (d < 5): The window < d < 5 is 
irrelevant for scaling purposes, yet it accounts for a finite 
fraction of the probability integral. An incorrect predic- 
tion in this region (due to deviations from the asymptotic 
form in Eq.ljSJ) consequently results in a finite vertical 
shift on the right-hand side of the distribution. 

Fig. 13 shows the degree distributions obtained from 
simulations of chains of various lengths. The agreement 
with the random-graph estimate calculated above is also 
shown (solid line). Not only the degree exponent, 7 = 
1/(2 — c) = 1.6, which is given by the slope in the log-log 
plot, but also the L dependence of the finite size cut-off 
is well predicted by Eas. 1)91101) . 

Although the 6'-point, too, appears as a good candidate 
with a predicted network exponent of 7 = 2.33(2.0) for 
2D (3D), relatively longer polymers required in order to 
obtain a convincing power law scaling make simulations 
harder due to either self-trapping or strong logarithmic 
corrections, respectively. 

Most known examples of SFNs also exhibit a high de- 
gree of clustering. Our graphs being ideally bipartite, the 
standard clustering coefficient based on counting trian- 
gles vanishes (except for a small contribution from buried 
polymer ends). Instead, we considered the probability 
that two nodes have a common nearest neighbor given 
that they already share another one and checked how it 
differs from its expected value for two uncorrelated nodes 
with the same degree. The latter probability is given by 

. , {N-d,-l)...{N-d,-d,) 

PranAdud,)-!- _ J) _ 3) . . . (TV - ) ' ^^^^ 
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where N is the number of nodes in the subgraph excluding 
the two nodes with degrees di,d2- Note that the Ea. Hll() 
is symmetric under the exchange of degrees. We found 
that (P((ii, c?2) — P{di,d2)rand) — 0.16, indicating a high 
degree of clustering. A feature possibly due to the spatial 
proximity of the loops corresponding to the two nodes 
that share a common nearest neighbor. 

In conclusion, we showed that a collapsed polymer 
globule in two dimensions accommodates a scale- free net- 
work. By means of a model based on the critical prop- 
erties of the polymer, we conjecture exact exponents for 
the scaling of the degree distribution. The model predicts 
similar behavior also at the 0-point, but with different de- 
gree exponents. Resulting networks are scale-free upto a 
degree number obtained by finite size scaling. The cut- 
off is determined by a power of L which is the inverse of 
the degree exponent 7, such that the power law scaling 
is less prominent (cut-off is more severe) at higher values 
of 7. It is conceivable that a similar mechanism applying 
to other known examples of SFNs may serve as a sieve 
selecting low degree exponents. We note that Barabasi- 
Albert networks 1] exhibit similar finite size effects 

The connection established here between polymers at 
low temperature and the scale-free networks will prove 
fruitful in other related problems as well. One possi- 
ble connection is to the TSP [l^l, where the topology of 
the related IG could serve as a checkpoint for the con- 
jectured correspondence with dense polymers. One is 
also tempted to attribute to the IG structure of a folded 
biopolymer, such as a protein in its native state, a sig- 
nature of the folding kinetics. For example, since the 
IG gives a measure of how "inter-penetrating" the con- 
tacts are, a correlation between high-degree nodes and 
the earlier formed contacts or the folding nucleus 129^ is 
plausible. 

This work was supported by Italian MIUR through 
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